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ON THE RANDOM GREEDY E-FREE HYPERGRAPH PROCESS 


DANIELA KUHN, DERYK OSTHUS AND AMELIA TAYLOR 


Abstract. Let E be a strictly fc-balanced fc-uniform hypergraph with e(E) > \F\ — k + 1 and 
maximum co-degree at least two. The random greedy E-free process constructs a maximal E-free 
hypergraph as follows. Consider a random ordering of the hyperedges of the complete fc-uniform 
hypergraph K k on n vertices. Start with the empty hypergraph on n vertices. Successively 
consider the hyperedges e of K k in the given ordering, and add e to the existing hypergraph 
provided that e does not create a copy of E. We show that asymptotically almost surely 
this process terminates at a hypergraph with 0(n k ~^ F ^~ k ^^ F ^~ 1 ' > ) hyperedges. This is best 
possible up to logarithmic factors. 


1. Introduction 

1.1. Results. Fix a fc-uniform hypergraph F. In this paper, we study the following random 
greedy process, which constructs a maximal E-free fc-uniform hypergraph. Assign a birthtime 
which is uniformly distributed in [0, 1] to each hyperedge of the complete fc-uniform hypergraph 
on n vertices. Start with the empty hypergraph on n vertices at time p = 0. Increase p 
and each time that a new hyperedge is born, add it to the hypergraph provided that it does not 
create a copy of E (edges with equal birthtime are added in any order). Denote the resulting 
hypergraph at time p by R n , P - 

The random greedy graph process (i.e. the case when fc = 2) has been studied for many 
graphs. The initial motivation (see for example 0 ) was to study the Ramsey number E(3,t). 
Indeed, the best current lower bounds on R(3,t) were obtained via the study of the triangle- 
free process m, (IQ]). Osthus and Taraz ED gave an upper bound on the number of edges in 
the graph R U} i when E is strictly 2-balanced, showing that a.a.s. R n ,\ has maximum degree 
0(n 1- ^ l -2 )/( e (- F ) -1 )(log (Here a.a.s. stands for ‘asymptotically almost surely’, 

i.e. for the property that an event occurs with probability tending to one as n tends to in¬ 
finity.) Results for the cases when E = C 4 and E = K 4 were obtained independently by 
Bollobas and Riordan [7j. Bohman and Keevash [3j showed that a.a.s. R n ,\ has minimum de¬ 
gree D(?r 1_ ^l _2 ^( e (' F ^ 1 )(logn) 1 /^^- 1-1 ^) whenever E is strictly 2-balanced and conjectured 
that this gives the correct order of magnitude. Improved upper bounds have been obtained 
for some graphs. For instance, the number of edges has been determined asymptotically when 
E is a cycle ([3j, [5], [TO] , 12], p3]) and when E = K 4 ([15], [16]). Picollelli [13J determined 
asymptotically the number of edges when E is a diamond, i.e. the graph obtained by removing 
one edge from K 4 . Note that this graph is not strictly 2-balanced. 

Much less is known about the process when E is a fc-uniform hypergraph and fc > 3. The 
only known upper bound is due to Bohman, Mubayi and Picollelli [6], who studied the E-free 
process when E is a fc-uniform generalisation of a graph triangle (with an application to certain 
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Ramsey numbers). In this paper, we obtain a generalisation of the upper bound in jTTj to strictly 
/c-balanced hypergraphs. Here we say that a /c-uniform hypergraph F is strictly k-balanced if 
|F| > /c + 1 and for all proper subgraphs F' C F with |I ?, | > k + 1 we have 

e{F) - 1 e(F') - 1 

\F\-k \F'\ — k ' 

We also need the following definition. Given a hypergraph H and i € N, we define the maximum 
i-degree of H by 

A i(H) := ma x{d H (U) : U C V(H), \U\ = i}, 
where dn(U) is the number of hyperedges in H containing U. 

Theorem 1.1. Let k € N be such that k > 2. Let F be a strictly k-balanced k-uniform hypergraph 
which has v vertices and h > v — k + 1 hyperedges. Suppose A^,_i (F) > 2. Then there exists a 
constant c such that a.a.s. 

. v — k _ 3 _ 1 

(1) Afc_ 1 (i? rij i) < t where t := cn~ 7rrT (log n) Afc - l(F J -1 h_1 . 

In particular, a.a.s. R n ,i has at most hyperedges. 

Note that Theorem 11.11 applies. for example, to all /c-uniform cliques K k on v > k + 1 vertices 
and more generally to all balanced complete Gpartite /c-uniform hypergraphs with I > k and 
more than k vertices. 

Bennett and Bohman [2] studied a random greedy independent set algorithm in certain quasi¬ 
random hypergraphs. Their result can be applied in the context of the F-free process to show 
that if F is a strictly /c-balanced /c-uniform hypergraph and every vertex of F lies in at least 
two hyperedges, then a.a.s. R. n . i has (logn) 1 ^ 6 ^' hyperedges. Up to 

logarithmic factors, this matches the upper bound given in Theorem ll.il 

1.2. An open question. There are many natural open questions related to the random greedy 

F-free process. Here we discuss bounds on the number of edges in R Uj \ when F is an Gcycle. 
Theorem 11.11 applies in the case when F is a /c-uniform tight cycle. However, there are other 
natural notions of a hypergraph cycle: Given I € N with I < k, we say that a /c-uniform hyper¬ 
graph Cf j, is an l-cycle of length h if there is a cyclic ordering of its vertices x \,..., x h ( k _^ and a 
corresponding ordering on its hyperedges eo, ■ • •, eh -1 such that e* = ..., 

So consecutive hyperedges on the cycle intersect in exactly I vertices. The case when i = k — 1 
corresponds to Cfy, being a tight cycle of length h. It is easy to check that all Gcycles are 
strictly /c-balanced, but only tight cycles satisfy the co-degree condition in Theorem 11.11 In the 
case when I > /c/2, Gcycles meet the conditions in [2j. We conjecture that the bound on the 
number of hyperedges in ( 2 j is of the correct magnitude for any I. 

Conjecture 1.2. Let £,k £ N be such that k >2 and k > I and let F := Ci t h be the l-cycle of 

he i 

length h. Then a.a.s. R U: i has ©(n ' 1 - 1 (log n) h ~ x ) hyperedges. 

One motivation for Conjecture 11.21 is that p = 77 ,^/ x ) (log 77 ,) x /(*• ^ is the threshold for 
the property that every hyperedge in H n)P lies in an Gcycle of length h. 

1.3. Sketch of the argument. Rather than studying the random greedy process itself, we 
are able to prove Theorem 11.11 by obtaining precise information about the random binomial 
hypergraph H np . (This idea was first used in [TTj.) More precisely, write H UtP for the random 
binomial /c-uniform hypergraph on n vertices with hyperedge probability p , i.e., each hyperedge 
is included in H n p with probability p, independently of all other hyperedges. We write H~ p 





ON THE RANDOM GREEDY F-FREE HYPERGRAPH PROCESS 


3 


for the hypergraph formed by removing all copies of F from H UiP . Note that H np can also be 
viewed as the random hypergraph consisting of all hyperedges with birthtime at most p. Thus, 
for all p € [0,1] we have 

H n p C R np C R n \. 

We will always assume that K f r ) , H n<p , H~ p and R n:P use the vertex set [n]. 

In Section El we collect some large deviation inequalities. The proof of Theorem 11.11 is given 
in Section [3l the strategy is as follows. We first identify the largest point p where we can still 
use H U)P to approximate the behaviour of H~ p (i.e. for this p, only a small proportion of edges 
of H ntP he in a copy of F). Now let U be a set of k — 1 vertices in F such that cIf(U) = A^_i(l ? ). 
Let F be the subgraph of F obtained by deleting all those hyperedges which contain U. Let t 
be as in Suppose for a contradiction that there exists a (k — l)-set V of degree t in R, n j 

and let T be the neighbourhood of V in R n \. We will show that in this case we would almost 
certainly find a copy a of F in H~ p [T U V] which maps U to V. Since H~ p C R n ^i, a would 
also be a copy of F in R n j [T U V] which maps U to V. But this actually yields a copy of F in 
R n ,i, a contradiction. So a.a.s. Ak-i(R n ,i) < t. It is perhaps surprising that for our analysis 
the order of hyperedges added after this critical point p is irrelevant. 

2. Tools 

Let S be a collection of subsets of E(K^). For each a£<S, let I Q denote the indicator variable 
which equals one if all hyperedges in a lie in H UtP and zero otherwise. Set 

X-.= ^I a and p:=E[X}. 
a£S 

Let Y be the size of a largest hyperedge-disjoint collection of elements of S in H n ^ p (i.e. the 
maximum size of a set S' C S such that I a = 1 for all a € S' and a n a’ = 0 for all distinct 
a, a' € S'). Erdos and Tetali [9] proved the following upper tail bound on Y. 

Theorem 2.1. [9]. For every a € N, we have P[T > a] < (ep/a) a . 

We also require a lower tail bound on Y. For all a, a' € S with a ^ a' , we write a ~ ol if 
a fl a' / 0. Define 

A := ^ E [Iala'), 
a! 

where the sum is over all ordered pairs a! ~ a in S. Also, let 

r] := maxE[/ a ] and v := max > E[/ a /]. 

aG«S olSLS ^ J 

a! $LS\ot! <^>a. 

The following bound follows from Lemma 4.2 in Chapter 8 and Theorem A. 15 in [lj, see Em- 

A £ 2 

Theorem 2.2. Let e > 0. Then P[T < (1 — e)p] < e ^ e ^ u+ 2 _ 

3. Proof of Theorem 11.11 

3.1. Basic parameters. Let F be a strictly fc-balanced fc-uniform hypergraph which has v 
vertices and h hyperedges. Let d := Af c _i(F) and choose positive constants ci,C 2 satisfying 

1/n 1/ci <C l/c 2 <1 /d, l/h. 
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(Here the notation a<t means that we can find an increasing function / for which all of the 
conditions in the proof are satisfied whenever a < /(£>).) Given functions / and g, we will write 
/ = 0 (g) if there exists a constant c such that /(n) < (log n) c g(n) for all sufficiently large n. 
Set 

Here p is chosen to be as large as possible subject to the constraint that a.a.s. only a small 
proportion of the hyperedges of H n ^ p lie in a copy of F. For each k + 1 < i < v, we define 

K := max{e(F') : F' C F, \F'\ = *}. 

Since F is strictly ^-balanced, we have 

h — 1 hi — 1 
v — k > i-k ‘ 

So for each + 1 < z < v we can define a positive constant 


5i := i — k — (hi - 1 )j—j > 0. 


( 2 ) 

Let 

5 := min{(5j : k + 1 < i < v}. 
We will often use that for k + 1 < i < v 


(3) 


-i h-hi ^ v-i-%=$(h-hi) © v-i-Z=$(h- (fe-1)) —Si ^ S 

n 1 < n h-— u h-v-k v JJ =n x <n 


v- «, .< 

n p 


Note that this bounds the expected number of extensions of a fixed subgraph of F on i vertices 
into copies of F in H UtP . 

3.2. Many copies of F containing a fixed hyperedge. For a given hyperedge / £ E(K^), 
an (r, /) -cluster is a collection F\, Fo ,..., F r of r copies of F such that each Fj contains / and 
for each 1 < i < r, there exists /* € E(Fi ) such that ^ E(Fj ) for any j < L Define to be 
the event that H n ^ p has no (logn, /)-cluster for any hyperedge /. We will bound the probability 
of A c , i.e., the probability that H np has a (logn,/)-cluster for some hyperedge /. 


Lemma 3.1. We have P[„4 C ] < n k . 

Proof. Fix some / € E(K k ). Write Z r j for the number of (r,/)-clusters in H 1ltP , so Z\ j 
counts copies of F which contain the hyperedge /. There are h hyperedges in F which could be 
mapped to /, so 

E [Z 1>f \ < hn v ~ k p h < e~ 2k 

with room to spare. Let r < logn and consider a fixed (r,/)-cluster C in H np . Let Zq be the 
number of (1, /)-clusters in H UtP which contain at least one hyperedge which does not lie in C, 
so each of these (1, /)-clusters together with C forms an (r + 1, /)-cluster. Suppose that a is a 
(1, /)-cluster sharing k + 1 < i < v vertices with C. The set of hyperedges shared by a and C 
form a proper subgraph of F on i vertices. Since F is strictly fc-balanced, a and C can have at 
most hi common hyperedges. This allows us to estimate E[Zc] as 

v © 

E [Z c \ < hn v ~ k p h ~ 1 + ^ v i (rv) i - k n v - i p h - hi < e~ 3k + 0(n~ s ) < e~ 2k . 

i=k -\-1 

If we sum over all (r, /)-clusters in K k , we find that 

E[Z r+1<f ] < E [Z rJ ]e~ 2k < e ~ 2 ( r+1)k 
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and hence E[Z\ ogn j] < n 2k . By summing over all / € E(K k ), we obtain 

F[_4. c ] < ff) rr 2k < n ~ k , 

as required. □ 


3.3. Estimating the number of extensions of a fixed set. Recall that d = A k-i(F). Let 
U = {u\,U 2 , ■ ■ ■ ,Uk- 1 } C V(F) be such that d^(f7) = d. Let Np{U) denote the neighbourhood 
of U in F. i.e. Np(U) := {x £ V(F) : U U {:r} £ E(F)}. Define F C F which has vertex set 
V(F) and all hyperedges / £ E(F) such that |/nf7| < k — 2. Fix T C [n] of size t and an ordered 
sequence V = (vi,V 2 , ■ ■ ■ ,Dc-i) of distinct vertices, where V{ £ [n] \ T for each 1 < i < k — 1. 
Given a hypergraph H C K k , let S(H) = S(H,T,V ) be the set of all copies of F in H such 
that the following hold: 

• for each 1 < i < k — 1, Ui is mapped to vf, 

• Np(U ) is mapped into T and 

• V(F) \ Np(U) is mapped into [n] \ T. 

We let X := \S(H n , p )\ and X~ := |<S(iL“ p )|. Note that X~ < X since H~ p C H n , p . 

Note that if T C Nn n l (V), then S(R nt 1 ) = 0, as otherwise we could find a copy of F in R n ,i- 
Since H~ p C R n 1 , it follows that X~ = 0. So, in order to prove Theorem o it will suffice to 
prove that a.a.s. we have X~ > 0 for any choice of T, V. 


Lemma 3.2. Given T C [to] of size t and an ordered sequence V = (y\, V 2 , ■ ■ ■, Vk- 1 ) of distinct 
vertices, where Vi £ [to] \ T for each 1 < i < k — 1, define X~ as above. Then 

P[(X” = 0) n A] < 2n~ 2t . 

Proof. Write S := S(K k ). Note that 


(4) 


t 4 ~ 1 n v - k p h - 1 


n~t-k+l\ h _ d 
v-d-k + l) t 
(log to ) 3 _ cf -1 


d d v v d d v v C2 1 


j.j.d—ly^y—d—k-\-lph—d 
- dd v v 

t(log n) 2 > 24/r 2 t(logn) 2 . 


Let < S'(H ntP ) be a hyperedge-disjoint collection of elements of S(H HjP ) of maximum size and let 
Yj := \S'(H ntP )\. In order to apply Theorem 12.21 we will estimate to, A and 77 . 

First we estimate to. Define 


to* := max E[J a / | I a = 1] 

a'GtS:a'~a 

and note that to < to*. We count the expected number of elements a' £ S(H n ^ p ) \ {a} sharing 
at least one hyperedge with some fixed element a £ S. Note that a and a' must share at least 
two vertices outside V. We let k + 1 < i + j < v denote the number of shared vertices, where 
i is the number of vertices shared in T. Consider any a' £ S \ {«} sharing i + j vertices with 
a. Let K be the hypergraph on i + j vertices formed by the set of hyperedges shared by a and 
a'. Let K' be the hypergraph on i + j vertices obtained from K by adding all hyperedges of the 
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form VUx for each of the i vertices x E T shared by a and a'. Since K' C F, e(K') < hi+j and 
so a and a' can have at most hi+ 3 — i common hyperedges. Then 

V 

v < v* < Y v i +H d - i n v - d - j p h - d -( hi +J - i ) 
i-\-j=k-\- 1 
v 

= vi+j {c 1 {\ogn)^) d - i n v -( i+ rip h - hi +i 1 d(n~ s ) = o(l). 

i+j=k-\-l 

Since A counts the expected number of ordered pairs of elements in S(H n:P ) which share at least 
one hyperedge, we have 

A < piv* = o(n 1 ). 

Finally, the probability of a fixed element in S being present in H n ^ p is given by 

n = p h - d = o{ l). 

So we can apply Theorem 12.21 to see that 

0 

(5) F\Yi < ni/2] < e -^/ 10 < n~ 2t . 

We define a cluster ( a,F') to be the union of an element a E S'{H n ^ p ) and a copy F' of F 
in H n p which share at least one hyperedge. Note that deleting F' from H n p to form H~ p will 
destroy a. 

We define an auxiliary graph G as follows. For each element of S\H np ) which lies in a cluster, 
choose one. These clusters form the vertices of G. Draw an edge between two vertices in G if 
the corresponding clusters share a hyperedge. We will use that 

(6) |G| < (A(G) + l)a(G) 

(which holds for all graphs) to bound the number of vertices in G. We will show that with 
sufficiently high probability |G| < Yf. (This in turn implies that at least one element of S\H n ^ p ) 
will remain in H~ p , i.e. X~ > 0.) 

First, we bound a(G). Let X 2 be the number of clusters in H n)P . We estimate p ,2 '■= E[X 2 ], 
breaking the sum into parts depending on the number i of vertices shared by a and F' in each 
cluster (a, F'). For k + 1 < i < v, we use that a and F intersect in a proper subgraph of F and 
thus can have at most hi common hyperedges. The first term in our bound on ^2 corresponds 
to those clusters (a, F') where a and F' share exactly one hyperedge: 

V 

P2 = E[x 2 ] < p 1 h 2 n v - k p h ~ 1 + Y W i n v ~ i P h ~ hi 

i=k+ 1 

(7) < n\h 2 n v ~ k p h ~ 1 + 0(pin~ s ) < p,i/(l2e 2 h 2 log re). 

Let Y 2 be a largest hyperedge-disjoint collection of clusters in FL n ^ p . We note that a(G) < Y 2 
and use Theorem EH to bound Y 2 : 

P [a(G) > pi/(12h 2 logn)] < P [l 2 > /ri/(12/i 2 log n)] < l°g ra 

\ Mi 

g-Pl/(12/i 2 log n) ® n -21' 


^ [12/(p 2 h? logn) 


(8) 
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We now bound A(G). Assume that A holds, that is, H n p does not contain a (logn, /)-cluster 
for any hyperedge /. Fix some hyperedge / £ E(H ntP ). Let E be a collection of clusters (on, Fi) 
such that / £ E((ai, Fi)) for each i and a* ^ aj if i A j. Suppose that IJ 7 ) >h logn+1. For each 
cluster ( ai,Fi ) in E, let e* be a hyperedge shared by on and Fi. The a* are hyperedge-disjoint, 
so / £ E(Fi ) for all but at most one cluster (ctj,! 7 )) £ E where / £ E(ati). If E contains such 
a cluster, delete it. Then, starting with i = 1, if (ai,Fi) has not already been deleted, delete 
from E any clusters ( otj,Fj) with j > i such that ej lies in ( on,Fi ). Do this for each i in turn. 
Since the a* are hyperedge-disjoint, at each step we delete at most h — 1 clusters from E. So a 
collection E' C E of at least logn clusters remains. But the set of all Fi such that (ctj,! 7 )) £ E' 
contains a (logn,/)-cluster in R U)P . Therefore, |.F| < hlogn + 1. Since every cluster has less 
than 2 h hyperedges, we must have 

(9) A (G) < 2/i 2 logn. 

So, if A holds, if a(G) < /ii/(12 h 2 logn) and if |Yi| > /ri/2, then 

mm 

|G| < (2/i 2 logn-f l)/Ui/(12/i 2 logn) </ii/4 < |li|. 

Thus, 

EL© 

F[(X- = 0) n A] = P[(|G| = Yl) n A}< P[Yi < m/2] + P[a(G) > a*i/(12 h 2 logn)] < 2 n~ 2t , 
as desired. □ 

3.4. Combining the bounds. We now use Lemmas 13.11 and 13.21 to prove Theorem ll.il 

Proof of Theorem 11.11 Define B to be the event that there exist T C [n] of size t and 
an ordered sequence V = (v\,V 2 , ■ ■ ■ ,Dc-i) of distinct vertices such that Vi £ [n] \ T for each 
1 < i < fc — 1 and X~ = 0. As remarked before Lemma 13.21 A^_i(i2 T j i) > t implies B. So we 
can apply Lemmas 13.11 and 13.21 to see that 

P[Afc_i(i2 Tli i) >t]< P [B] < F[A C ] + P [A n B}< n~ k + n t+k - 1 (2n~ 2t ) = o(l). 

This completes the proof of Theorem 11.11 □ 
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